Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data.
نویسندگان
چکیده
DNA sample contamination is a serious problem in DNA sequencing studies and may result in systematic genotype misclassification and false positive associations. Although methods exist to detect and filter out cross-species contamination, few methods to detect within-species sample contamination are available. In this paper, we describe methods to identify within-species DNA sample contamination based on (1) a combination of sequencing reads and array-based genotype data, (2) sequence reads alone, and (3) array-based genotype data alone. Analysis of sequencing reads allows contamination detection after sequence data is generated but prior to variant calling; analysis of array-based genotype data allows contamination detection prior to generation of costly sequence data. Through a combination of analysis of in silico and experimentally contaminated samples, we show that our methods can reliably detect and estimate levels of contamination as low as 1%. We evaluate the impact of DNA contamination on genotype accuracy and propose effective strategies to screen for and prevent DNA contamination in sequencing studies.
منابع مشابه
Detecting and Correcting Contamination in Genetic Data by
DNA sample contamination is a serious problem in DNA sequencing studies, and may result in systematic genotype misclassification and false positive associations. While methods exist to detect and filter out cross-species contamination, few methods to detect within-species sample contamination are available. In this paper, we describe methods to identify withinspecies DNA sample contamination ba...
متن کاملI-38: Chromosome Instability in The Cleavage Stage Embryo
Recently, we demonstrated chromosome instability (CIN) in human cleavage stage embryogenesis following in vitro fertilization (IVF). CIN not necessarily undermines normal human development (i.e. when remaining normal diploid blastomeres develop the embryo proper), however it can spark a spectrum of conditions, including loss of conception, genetic disease and genetic variation development. To s...
متن کاملClinical Relevance of Cytokines Gene Polymorphisms and Protein Levels in Gingival Cervical Fluid from Chronic Periodontitis Patients
Background: Cytokines are suggested to play a role in periodontitis. Objective: To determine and compare the levels of Interleukin-1 beta (IL-1β) and Tumor necrosis factor alpha (TNF-α) in gingival crevicular fluid (GCF) samples amongst healthy individuals and those with chronic periodontitis. Further to compare the GCF cytokine levels in three genotype classes defined by the respective gene p...
متن کاملSequence-based genotyping of hepatitis B virus in general popula-tion
Background: Hepatitis B Virus (HBV) causes acute and chronic liver disease worldwide. HBV has eight genotypes (A to H) which is the reflection of its genome with their characteristic geographical distribution. Each genotype could have different pathogenic and therapeutic characteristics. There have been few records on HBV genotyping in general population from our region. This study aimed to...
متن کاملAncient DNA from Human and Animal
Research on ancient DNA (aDNA) has the potential to enable molecular biologists and archeologists to decipher certain aspects of history by direct looking into the past. However, several major problems in this field limit the applicability of aDNA studies, most importantly contamination with modern DNA and postmortem DNA degradation. In this study we extracted and analyzed aDNA obtained from ~3...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- American journal of human genetics
دوره 91 5 شماره
صفحات -
تاریخ انتشار 2012